Learning to Minimise Regret in Route Choice

نویسندگان

  • Gabriel de Oliveira Ramos
  • Bruno Castro da Silva
  • Ana L. C. Bazzan
چکیده

Reinforcement learning (RL) is a challenging task, especially in highly competitive multiagent scenarios. We consider the route choice problem, in which self-interested drivers aim at choosing routes that minimise their travel times. Employing RL here is challenging because agents must adapt to each others’ decisions. In this paper, we investigate how agents can overcome such condition by minimising the regret associated with their decisions. Regret measures how much worse an agent performs on average compared to the best fixed action in hindsight. We present a simple yet effective regret-minimising algorithm to address this scenario. To this regard, we introduce the action regret, which measures the performance of each route in comparison to the best one, and employ it as reinforcement signal. Given that agents do not know the cost of all routes (except for the currently taken ones) in advance, we also devise a method through which they can estimate the action regret. We analyse the theoretical properties of our method and prove it minimises the agents’ regret by means of the action regret. Furthermore, we provide formal guarantees on the agents’ convergence to a φ-approximate User Equilibrium, where φ is the bound on the agents’ regret. To the best of our knowledge, this is the first work in which RL-agents are formally proven to converge to an approximate UE, without further assumptions, in the context of route choice.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimising Regret in Route Choice

The use of reinforcement learning (RL) in multiagent scenarios is challenging. I consider the route choice problem, where drivers must choose routes that minimise their travel times. Here, selfish RL-agents must adapt to each others’ decisions. In this work, I show how the agents can learn (with performance guarantees) by minimising the regret associated with their decisions, thus achieving the...

متن کامل

On Estimating Action Regret and Learning From It in Route Choice

The notion of regret has been extensively employed to measure the performance of reinforcement learning agents. The regret of an agent measures how much worse it performs following its current policy in comparison to following the best possible policy. As such, measuring regret requires complete knowledge of the environment. However, such an assumption is not realistic in most multiagent scenar...

متن کامل

Risk Route Choice Analysis and the Equilibrium Model under Anticipated Regret Theory

The assumption about travellers’ route choice behaviour has major influence on the traffic flow equilibrium analysis. Previous studies about the travellers’ route choice were mainly based on the expected utility maximization theory. However, with the gradually increasing knowledge about the uncertainty of the transportation system, the researchers have realized that there is much constraint in ...

متن کامل

Relationship between impulsive choice and emotional distress with students’ procrastination: mediating role of anticipated regret and consideration of future consequences

Given the wide prevalence of procrastination and delaying tasks and the need to identify factors affecting this problem, present study aimed to investigate the mediating role of anticipated regret and consideration of future consequences in the relationship between impulsive choice and emotional distress with procrastination. In an analytical cross-sectional study, 400 students were selected th...

متن کامل

The role of regret minimisation in lifestyle choices affecting the risk of coronary heart disease.

This paper introduces the discrete choice model-paradigm of Random Regret Minimisation (RRM) to the field of health economics. The RRM is a regret-based model that explores a driver of choice different from the traditional utility-based Random Utility Maximisation (RUM). The RRM approach is based on the idea that, when choosing, individuals aim to minimise their regret-regret being defined as w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017